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SIMPLE TYPES IN XML SCHEMA COMPLEX TYPES 



5 BACKGROUND OF INVENTION 

This invention relates to allowing the presence of a simple type in an XML Schema complex 
type. 

10 Extensible mark-up language (XML) is a mark-up language for documents containing 

structured information. XML provides a facility to define tags and the structural relationship 
between them in the document. An XML Schema describes and constrains the content of 
XML documents. 

15 The W3C XML Schema does not permit a complex type to include a simple type without 

defining a new element. Also if a complex type inherits from a Simple Type then it can have 
only simple content, i.e. it cannot have child elements. As a result, it is not possible to model 
accurately the XML Schema for the following XML stream where the contents of element 
Fred contains an arbitrary string (xxxx) , an element Joe, an integer (67890102), an 

20 element Jim and finally an integer (12345). 

<Fred>xxxx<Joe/>67890102<Jim/>12345</Fred> 

Such types of XML Streams are very common. The XML Schema specification does allow a 
25 mixed flag to be set in respect of a complex type so that an instance of the complex type can 

contain arbitrary strings between the elements. Nevertheless, you cannot precisely specify 
any order or position for the strings within the XML document, as illustrated in the example 
above. 

30 This invention seeks to avoid some of the described drawbacks. 
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SUMMARY OF INVENTION 

The present invention describes an XML Schema and a parser for the aforedescribed kinds of 
5 XML streams. More specifically, the subject invention seeks to, within the scope of existing 

XML Schema specifications, provide an approach to embed simple types within a complex 
type so an XML stream can have instances of the embedded simple types, even without the 
presence of XML elements. Because the approach is within the scope of existing XML 
Schema specifications, the approach is transparent to known XML parsers. 

10 

According to the present invention, there is provided an XML schema, comprising: 
at least one complex type, said complex type having a mixed flag set to true and including an 
element set to be a dummy element, said dummy element having a simple type, one of (i) a 
name of said dummy element and (ii) a name of said simple type being one of a 
15 predetermined set of names for use with a parser which, on parsing said schema, will 

interpret said dummy element as one of an embedded simple type and an inherited simple 
type. 

According to another aspect of the present invention, there is provided a method of parsing, 
20 comprising: receiving an XML stream; parsing said XML stream on encountering a parent 

element in said XML stream, utilising an XML schema to locate a type for said parent 
element; where said type is a complex type, determining whether a mixed flag for said 
complex type in said schema is set to true; where said mixed flag is set to true, interpreting 
fragments embedded in said parent element in accordance with said complex type, each 
25 fragment being one of an arbitrary string and an element; where, in accordance with said 

complex type, an embedded fragment corresponds to a dummy element having a simple type, 
with one of (i) a name of said dummy element and (ii) a name of said simple type being one 
of a predetermined set of names, interpreting said embedded fragment as one of an embedded 
simple type and an inherited simple type. A computer readable medium containing computer 
30 executable instructions to affect the method is also provided. 
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According to a further aspect of the present invention, there is provided a parser, comprising: 
means for receiving an XML stream; means for parsing said XML stream means for, on 
encountering a parent element in said XML stream, utilising an XML schema to locate a type 
for said parent element; means for, where said type is a complex type, determining whether a 
5 mixed flag for said complex type in said schema is set to true; means for, where said mixed 

flag is set to true, interpreting fragments embedded in said parent element in accordance with 
said complex type, each fragment being one of an arbitrary string and an element; means for, 
where, in accordance with said complex type, an embedded fragment corresponds to a 
dummy element having a simple type, with one of (i) a name of said dummy element and (ii) 
10 a name of said simple type being one of a predetermined set of names, interpreting said 

embedded fragment as an embedded simple type. 



Other features and advantages of the invention will become apparent from the following 
description in conjunction with the drawings. 

15 

BRIEF DESCRIPTION OF THE DRAWINGS 



The figure, which illustrates an example embodiment of the invention, is a schematic view of 
a system configured in accordance with this invention. 

20 

DETAILED DESCRIPTION 



The following approach is used so that, within the scope of existing XML Schema 
specifications, a mechanism/pattern is provided to embed simple types within a complex type 
25 in order that an XML stream can have instances of the embedded simple types, even without 

the presence of XML elements. 

1. Custom simple types are used which extend the XML Schema simple types so a parser 
operating in accordance with this invention can recognize these special types. Each of these 
30 simple types has a name such that the set of these custom simple types provides a set of pre- 

determined names. 
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2. The mixed flag is set to true on the embedding complex type. ~ As per XML Schema 
specifications, the mixed flag on the complex type indicates that its instance can have 
arbitrary strings in between the elements. 

5 

3. Elements or arbitrary strings of these custom simple types are created within the 
embedding complex type. These elements/arbitrary strings are wrapped under an anonymous 
group sequence in which minOccurs and maxOccurs on the sequence is set to 0. As per the 
XML Schema specification, if both of these flags are set to 0 then the XML instance of the 

1 0 complex type cannot have any occurrence of contents of sequence. In other words, such 

elements are dummy elements and logically they are not part of the complex type. In 
consequence, an XML parser which is not configured to operate in accordance with this 
invention will simply ignore this group. It is noted that the minOccurs and maxOccurs on the 
elements of these custom simple types have their usual meaning, i.e., they determine whether 

1 5 these elements are optional or mandatory. 

By combining these constructs , a special case is created for the custom XML parser (i.e., a 
parser operating in accordance with this invention) to treat these dummy elements as just the 
simple types and to parse the arbitrary strings in the instance (a by-product of the mixed 
20 declaration defined on the embedding complex type) according to the defined type of the 

element. 

Since the declaration of these dummy elements, wrapped under an anonymous group within 
the complex type, may be positional (by use of the "sequence" construct), the custom XML 
25 parser can ensure that such arbitrary strings in the XML instance adhere to the relative 

position of the declaration of these dummy elements in the complex type. 

The XML instance generated for the complex type (containing these dummy elements) will 
parse successfully by any open source XML Schema complaint parser because it will contain 
30 arbitrary strings at the position where the dummy elements are defined; this is OK because 

the mixed flag on the complex type is set to true. 
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The following schema contains Customized Simple types to enable embedding of a simple 
type within a complex type and extending from a simple type. 

5 • Types starting with name ComlbmMrmBaseValuexxxx are to be used in the scenario 

where a complex type extends a simple type. 
• Types starting with name ComlbmMrmAnonxxx are to be used in the scenario where a 
complex type includes a simple type. 

1 0 The reference to "wmqi21" in the example is simply a reference to the customized parser (i.e., 
the parser operating in accordance with this invention). 
<?xml version="1.0" encoding="UTF-8 ,, ?> 

<xsd : schema xmlns : xsd= " ht tp : / /www . w3 . org/ 2 001 /XMLSchema " > 

1 5 <xsd : annotation> 

<xsd : document at ion> 

ComIbmMrm_BaseValuexxx type to be used when a complex 
type extends a simple type 

< /xsd : document at ion> 
20 </xsd:annotation> 

<xsd : simpleType name= "ComIbmMrm_BaseValueString" > 

<xsd : restriction base= "xsd : string" ></xsd : restrict ion> 
</xsd : simpleType> 



25 



<xsd : simpleType name= "ComIbmMrm_BaseValueInt " > 
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<xsd : restriction base= "xsd : int " x/xsd : restriction 
</xsd: simpleType> 

<xsd : simpleType name= "ComIbmMrm_BaseValueDateTime " > 

<xsd : restriction base= "xsd : dateTime "x/xsd : restriction 
</xsd: simpleType> 

<xsd: simpleType name="ComIbmMrm_BaseValueDate"> 

<xsd : restriction base= "xsd : date "x/xsd : restriction 
</xsd: simpleType> 

<xsd : simpleType name= "ComIbmMrm_BaseValueTime 11 > 

<xsd : restriction base= "xsd : time 11 x/xsd : restriction 
</xsd: simpleType> 

<xsd: simpleType name="ComIbmMrm_BaseValueFloat"> 

<xsd : restriction base= "xsd : float "x/xsd : restriction> 
</xsd: simpleType> 

<xsd : simpleType name= " ComIbmMrm__BaseValueDecimal 11 > 

<xsd : restriction base= "xsd : decimal "x/xsd : restriction 
</xsd : simpleType> 

<xsd : simpleType name= "ComIbmMrm_BaseValueBoolean" > 
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<xsd : restriction base= "xsd : boolean" ></xsd : restriction 
</xsd : simpleType> 

<xsd: simpleType name= ,I ComIbmMrm_BaseValueBinary ,l > 
5 <xsd : restriction base= "xsd : hexBinary " ></xsd : restriction 

</xsd : simpleType> 

<xsd: simpleType name="ComIbmMrmJBaseValueGYear"> 

<xsd : restriction base= "xsd : gYear 11 ></xsd : restriction> 
10 </xsd:simpleType> 

<xsd : simpleType name= " ComIbmMrm_BaseValueGYearMonth" > 
<xsd : restriction 
base= "xsd : gYearMonth" ></xsd : restriction> 
15 </xsd: simpleType> 

<xsd: simpleType name="ComIbmMrm_BaseValueGMonth"> 

<xsd : restriction base= "xsd : gMonth" > < /xsd : restrict ion> 
</xsd : simpleType> 

20 

<xsd : simpleType name= "ComIbmMrm_BaseValueGMonthDay" > 

<xsd : restriction base= "xsd : gMonthDay " ></xsd : restriction> 
</xsd : simpleType> 
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<xsd : simpleType name= " ComIbmMrm_BaseValueGDay " > 

<xsd : restriction base= "xsd : gDay " ></xsd : restrict ion> 
</xsd : simpleType> 

<xsd : annotation> 

<xsd : documentation> 

ComIbmMrm_Anonymous_xxx type to be used when a 
complex type includes a simple type 
</xsd : document at ion> 
</xsd : annotation> 

<xsd : simpleType name= ,l ComIbmMrm_AnonString" > 

<xsd : restriction base= "xsd : string" ></xsd : restriction> 
</xsd: simpleType> 

<xsd : simpleType name= 11 ComIbmMrm_AnonInt " > 

<xsd : restriction base= "xsd : int " >< /xsd : restriction> 
< /xsd : simpleType> 

<xsd: simpleType name="ComIbmMrm_AnonDateTime"> 

<xsd : restriction base= "xsd : dateTime " ></xsd : restriction> 
</xsd : simpleType> 



<xsd : simpleType name= "ComIbmMrm_AnonFloat " > 
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<xsd : restriction base= "xsd : float » x/xsd : restriction 
</xsd: simpleType> 

<xsd : simpleType name= "ComIbmMrm_AnonDecimal " > 
5 <xsd : restriction base= "xsd : decimal " ></xsd : restriction> 

</xsd : simpleType> 

<xsd : simpleType name= "ComIbmMrm_AnonBoolean" > 

<xsd : restriction base= "xsd : boolean" >< /xsd : restriction 
10 < /xsd : simpleType> 

<xsd : simpleType name= " ComIbmMrm_AnonBinary " > 

<xsd : restriction base= "xsd : hexBinary 11 ></xsd : restrict ion> 
</xsd: simpleType> 
15 < /xsd : schema> 

Example: The following example shows the equivalent XML Schema representation for a 
complex type, which extends a simple type String and includes a simple type Integer. 
<?xml version="1.0" encoding="UTF-8"?> 

20 

< s chema xmlns = " ht tp : / /www . w3 . or g / 2 0 0 1 /XMLS chema " 
targe tNamespace= "http : //www . ibm . com" 
xmlns :wmqi21Example="http: //www. ibm. com" > 
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<annotation> 

<documentation> 

MRMSequenceExtendingBaseTypeAndlncludingSimpleType 
MRMChoiceExtendingBaseTypeAndlncludingSimpleType 

They extend from a simple type String and include 
simple type Integer 

</documentation> 
< / anno t a t i on > 

<include schemaLocation= n wmqi21 .xsd" /> 
<complexType 

name= "MRMSequenceExtendingBaseTypeAndlncludingSimpleType " 
mixed= "true "> 

< sequence > 

< sequence minOccurs="0" maxOccurs="0"> 
< element 

name="dummy_elem_string_inheritence n 

type= "wmqi21Example : ComIbmMrm_BaseValueString" minOccurs= " 1 " 
maxOccurs= " 1 ">< /element > 

</sequence> 

<element name="eleml" type=" string" >< /element > 
<sequence minOccurs="0" maxOccurs= n 0"> 
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<element name= "dummy_elem_included_type_int " 
type= " wmqi2 lExample : ComIbmMrm_AnonInt 11 rainOccurs= 11 0 " 
maxOccurs= 11 1 11 ></ element > 

</ sequence > 

<element name= l! elem2" type= 11 integer ">< /element > 
</sequence> 
</complexType> 
<complexType 

name= "MRMChoiceExtendingBaseTypeAndlncludingSimpleType " 
mixed= "true "> 

<sequence> 

<sequence minOccurs^'O" maxOccurs= l! 0"> 
<element 

name="dummy_elem_string_inheritence n 

type= "wmqi2 lExample : ComIbmMrm_BaseValueString" minOccurs= " 1 11 
maxOccurs= 11 1 ">< /element > 

</'sequence> 
<choice> 

<element name="elem3 11 
type= " string ">< /element > 

< sequence minOccurs= n 0" maxOccurs="0"> 
< element 

name= " dummy_elem_included_type_int 11 
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type= "wmqi21Example : ComIbmMrm_AnonInt " minOccurs= " 0 " 
maxOccurs= 11 1 ">< /element > 

</sequence> 
<element name="elem4 " 
5 type= 11 integer ">< /element > 

</choice> 
</sequence> 
< / comp 1 exType > 



10 <element name="mrmSequenceElement " 

type="wmqi21Example:MRMSequenceExtendingBaseTypeAndIncludingSimp 
1 eType " >< / e 1 emen t > 

<element name= "mrmChoiceElement " 
type="wmqi21Example:MRMChoiceExtendingBaseTypeAndIncludingSimple 

1 5 Typ^ 11 > < / element > 

</schema> 

Here is the instance document pertaining to mrmSequenceElement whose type is 
MRMSequenceExtendingBaseTypeAndlncludingSimpleType. Note that the 
20 dummy_elem_string_inheritance mirrors the inheritance from simple type String and 

0123456 mirrors the inclusion of simple type int. 

<?xml versions "1.0" encoding="UTF-8"?> 
<wmqi2 lExample : mrmSequenceElement 
25 xmlns : wmqi2 lExample= " ht tp : / /www . ibm . com" 

xmlns : xs i = " ht tp : / / www . w3 . org / 2 0 0 1 /XMLS chema - ins t ance 11 
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xsi : schemaLocation= "http : / /www. ibm. com 
wmqi21Example.xsd "> 

dummy_elem_string_inheritence 
<eleml>eleml</eleml> 
5 0123456 

<elem2>0</elem2> 
</wmqi21Example : mrmSequenceElement> 



The following is the instance document pertaining to mrmChoiceElement whose type is 
1 0 MRMChoiceExtendingBaseTypeAndlncludingSimpleType. Note that the 

dummy_elem_string_inheritance mirrors the inheritance from simple type String. Due to the 
choice construct, only elem3 is present in the instance document. 

<?xml version="l . 0" encoding= f, UTF-8"?> 
1 5 <wmqi21Example : mrmChoiceElement 

xmlns : wmqi21Example="http : //www. ibm. com" 

xmlns : xsi= " ht tp : / /www . w3 . org/2 0 0 1 /XMLSchema- instance " 
xsi :schemaLocation= "http: //www. ibm. com wmqi21Example .xsd 11 > 
dummy_elem__string_inheritence 
20 <elem3 >elem3 < /elem3 > 

< /wmqi2 lExample : mrmChoiceElement > 

Note that both of the above instance documents are valid because the mixed flag was set to 
true on their respective complex types. 

25 

The figure illustrates an example system employing this invention. A customized parser 10, 
(which may be any known parser capable of handling the W3C XML schema, modified to 
operate as aforedescribed) has available to it the extended W3C XML schema 12. The 
extended W3C XML schema extends the standard W3C XML schema with the aforenoted 
30 customized simple types. In operation, an XML stream 14 inputs the parser 10, causing the 

parser to generate events 16. The customized parser 10 may be a processor loaded with 
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software from a computer readable medium 20, such as a diskette, CD-ROM, memory chip, 
or a file downloaded from a remote source. 

Rather than wrapping elements of the custom simple types in a group with minOccurs and 
5 maxOccurs set to zero in order to ensure that a "regular" XML parser (i.e., an XML Schema 

compliant parser which is not configured to operate in accordance with this invention) will 
ignore these elements, an alternate approach may be used. More specifically, the minOccurs 
and maxOccurs of each of these elements is set to zero. Thus, a "regular" parser will ignore 
these elements. However, an annotation statement (which will also be ignored by a "regular" 
1 0 parser) is employed to give the parser of this invention the information as to the "real" 

minOccurs and maxOccurs of each of these elements. The customized parser recognizes this 
information by virtue of the use of an "appinfo" statement in the annotation. 

This pattern is used for a complex type that has the content model set to "all" group (which 
15 does not allow sequence or choice group elements). In this case the dummy element is 

embedded inside "all" group. 

The dummy element pertaining to the inheritance of simple type is best placed as the very 
first element in the complex type. For the complex type having the content model "choice", 
20 the top level content model is set to sequence, the dummy element pertaining to inheritance 

of simple type is best placed as the first element, and the actual choice content follows the 
dummy element, as illustrated in the type 

MRMChoiceExtendingBaseTypeAndlncludingSimpleType in the folllowing example. 

25 

<?xml version="l. 0" encoding="UTF-8"?> 

<schema xmlns="http: //www.w3 . org/2 001/XMLSchema 11 

targe tNamespace= " htp : / / www , ibm . com 11 
mlns : wmqi2 lExample= " ht tp : / /www . ibm . com " > 

30 

<annotation> 
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< document at ion> 

MRMSequenceExtendingBaseTypeAndlncludingSimpleType 

MRMChoiceExtendingBaseTypeAndlncludingSimpleType 
they extend from a simple type String and it 

include simple type Integer 
< /documentation 

</annotation> 

<include schemaLocation="wmqi21 .xsd" /> 
<complexType 

name= "MRMSequenceExtendingBaseTypeAndlncludingSimpleType " 
mixed= "true 11 > 

<sequence> 

<element name="duirany_elem_string_inheritence" 
type="wmqi21Example : ComIbmMrm_BaseValueString" 
minOccurs=="0" maxOccurs="0" /> 

<element name= n eleml" type=" string" /> 

<element name= "dummy_elem_included_type_int " 
type= "wmqi21Example : ComIbmMrm_AnonInt " minOccurs= " 0 " 
maxOccurs= " 0 " > 

<annotation> 
<appinfo 

source= "WMQI_APPINFO n >MinOccurs==0 , MaxOccurs=l</appinf o> 
< / annot at ion> 
< /element > 
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<element name="elem2 11 
type= 11 integer ">< /element > 

</sequence> 
< / compl exType> 

< comp 1 exType 

name="MI^ChoiceExtendingBaseTypeAndIncludingSimpleType 
mixed= "true "> 

<sequence> 

<element 

name="dummy_elem_string_inheritence" 

type= "wmqi21Example : ComIbmMrm_BaseValueString" 

minOccurs= 11 0 " maxOccurs= 11 0 ">< /element > 

<choice> 

<element name="elem3 " type=" string" /> 
<element 

name= "dummy_elem_included_type_int " 

type= 11 wmqi21Example : ComIbmMrm_AnonInt 11 minOccurs= " 0 " 
maxOccurs= " 0 " > 

<annotation> 

<appinf o>MinOccurs=0 , MaxOccurs=l</appinf o> 

< / anno t a t i on > 
</element> 

<element name="elem4 " 
type= 11 integer ">< /element > 
</choice> 
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</sequence> 
< / complexType > 

<element name= "mrmSequenceElement " 

type= " wmqi2 lExample : MRMSequenceExtendingBaseTypeAndlncludin 

gS imp 1 eType 11 >< / e 1 emen t > 

<element name= n mrmChoiceE lenient" 
type= "wmqi2 lExample :MRMChoiceExtendingBaseTypeAndIncludingSimp 
leType" >< /element > 

</schema> 

As an alternative to providing each custom simple type with a name that will be recognized 
by the custom parser as indicating one of an embedded simple type or an inherited simple 
type, the name of the dummy element can be used for this purpose. Thus, there would be a 
set of names for dummy elements, each name representing one custom simple type. 

Other modifications will be apparent to those skilled in the art and, therefore, the invention is 
defined in the claims. 
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